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Abstract 

We analyze the properties of model food webs and of fifteen community food webs from a 
variety of environments — including freshwater, marine-freshwater interfaces and terrestrial 
environments. We first perform a theoretical analysis of a recently proposed model for food 
webs — the niche model of Williams and Martinez ( 2000). We derive analytical expressions for 
the distributions of species' number of prey, number of predators, and total number of trophic 
links and find that they follow universal functional forms. We also derive expressions for a 
number of other biologically relevant parameters which depend on these distributions. These 
include the fraction of top, intermediate, basal, and cannibal species, the standard deviations 
of generality and vulnerability, the correlation coefficient between species' number of prey and 
number of predators, and assortativity. We show that our findings are robust under rather general 
conditions; a result which could not have been demonstrated without treating the problem ana- 
lytically. We then use our analytical predictions as a guide to the analysis of fifteen of the most 
complete empirical food webs available. We uncover quantitative unifying patterns that describe 
the properties of the model food webs and most of the trophic webs considered. Our results sup- 
port a strong new hypothesis that the empirical distributions of number of prey and number of 
predators follow universal functional forms that, without free parameters, match our analytical 
predictions. Further, we find that the empirically observed correlation coefficient, assortativity, 
and fraction of cannibal species are consistent with our analytical expressions and simulations 
of the niche model. Finally, we show that two quantities typically used to characterize com- 
plex networks, the average distance between nodes and the average clustering coefficient of the 
nodes, show a high degree of regularity for both the empirical data and simulations of the niche 
model. Our findings suggest that statistical physics concepts such as scaling and universality 
may be useful in the description of natural ecosystems. 
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1 Introduction 



In ecosystems, spe cies are conn ected through i ntricate trophic relations hips which define corn - 
plex food webs (B riandand Cohen. 1984: .Cohen etal... .1990; Williams and Martine?! hOOd . 
Understanding the structure an d mechanisms underlying the formation of these complex webs is 
of great importance in ecology ( Rejmanek and Starv , 19791: Briand and Cohen , 1987t Cohen et al. 



199n[ iPolisL 1199 Ir) . In particular, food web structure provides insights into the behavior of 



ecosystems under perturbations such as the introduction of new species or the extinction of 
existing ones. It is thought that the nonlinear response of the elements composing the net- 
work may le a d to p o ssibly ca t astroph ic outcomes for even small perturbations (Berlow, 1999; 



Chapin et all ilOOdl : iMcKannL llOOd) . Thus, an understanding of the topology and assembly 



mechanisms of food webs may be of great practical interest with regard to crop selection, man- 
agement of fishing stocks, preservation of threatened ecosystems, and maintenance of biodiver- 
sity (Pimm et al... ,l 99.5.: .Stone. J 999: .Covington. ^OOOi Hutchings. .20001 



Due to the importance of food webs, much effort has been put in to collecting empirical data 
and uncovering unifying patterns whi ch describe t heir structure (Reimanek and Starv", '19791; 
LBriand and Cohen, 1984 1987; Lawton a nd WarrenL [l988: Cohen et al, 1990; Hall and Raffaelll 
fl99ll: IPimmet ail Il99ll: IPolisL 1 1 99 1[ [Martinez . 1993). However, in the last decade the con- 
struction of larger and more complete food webs has cle arly demonstrated that the previously 
repor ted unifying patterns do not hold for the new webs (iHall and RaffaelliL [l99llll993l: IPolisl 



1991h . Indeed, the complexity of the new webs has rendere d it quite c hallenging to obtain quan 



titative patterns with which to "substitute" the old ones (.Dunne et al.Ll2002i) 



The purpose of this work is twofold. First, to perform a theoretical analysis of a recently pro- 
posed model for food webs — the niche model of Williams and Martinez (2000). Second, to use 
the predictions of the model as a guide to a systematic statistical analysis of community food 
webs from a variety of environments. 



Remarkably, we uncover quantitative unifying patterns that describe the properties of most of 
the diverse trophic webs considered and capture the random and non-random aspects of their 
structure. Specifically, we find that several quantities — such as the distributions of number of 
prey, number of predators, and number of trophic links — characterizing these diverse food webs 
obey robust functional forms that, as predicted by our analytical results for the niche model, 
depend on a single parameter — the linkage density of the food web. 



The organization of the paper is as follows. In Section 2 we study the niche model proposed 
by Williams and Martinez (2000) analytically and numerically. In Section 3 we analyze the 
empirical food webs and show the existence of robust quantitative patterns. Finally, in Section 
4 we present some concluding remarks. 



Patterns in food web structure, J. Camacho et al. 



4 



2 Analytical solution of the niche model 



Recently, Williams and Martinez (|200n|) have proposed a model for food web structure — the 
niche model — that with just a couple of ingredients appears to successfully predict key structural 
properties of the most comprehensive food webs in the literature. Numerical simulations of the 
niche model predicted values for many quantities typically used to characterize empirical food 
webs that are consistent with measured values for seven empirical webs. 

In this Section, we systematically investigate the niche model from a theoretical perspective. 
We obtain analytical expressions for a number of quantities characterizing the structure of food 
webs in the limit of sparse food webs, i.e., webs with L <^ S"^, where L is the number of 
trophic interactions between species and S is the number of species in the web. This limit is the 
one of interest in ecology because (i) for most food webs reported in the literature the directed 
connectance, defined as C = L I S"^, take value s much smaller than one, and (ii) i t corresponds 
to the limit of large web sizes S (jBriand and Cohen. 1984 . 1987; Sole et al. , 2002 ), which is the 
interesting limit if one surmises that geographically separate ecosystems are in fact connected. 



We first calculate the probability distributions of number of prey and of number of predators 
and find that for C <^ 1 they depend only on one parameter of the model — the linkage density 
z = L/S , i.e. the average number of prey o r predators. These distributions give valuable 
information about the structure of the network ( Albert et al. . 2000|) and enable us to calculate 
other interesting quantities such as the fraction of "top," "intermediate," and "basal" species, 
and the standar d deviation of the "vulnerability" and "generality" of the species in the food web 
dWilliams and Martinez. .2000) . We also calculate the correlation coefficient between number of 
prey and number of predators, the fraction of cannibals present, and two additional properties 
of interest in the characterization of complex networks: the a verage "distance" between species 
and the local redundancy of the connection between species ( Watts and Strogat3. 1998). 



2.1 The niche model 



Consider an ecosystem with S species and L trophic interactions between these species. These 
species and interactions define a network with S nodes and L directed links. In the niche model, 
one first randomly assigns 5* species to "trophic niches" with niche values Ui mapped uniformly 
onto the interval [0,1]. For convenience, we will assume that the species are ordered according 
to their niche number, i.e., ni < ^2 < ... < ns. 

A species i is characterized by its niche parameter Ui and by its list of prey. Prey are chosen 
for all species according to the following rule: A species i preys on all species j with niche 
parameter uj inside a segment of length centered in a position chosen randomly inside the 
interval [aj/2,nj], with Oj = xui and < x < 1 a random variable with probability density 
function 

p,{x)=b{l-xf-'^ . (1) 
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Williams and Martinez (■200(7) appeared to have chosen this functional form for convenience, 
but, we will show later that the predictions of the model are mostly robust to changes in the 
specific form of p^. 

The values of the parameters h and 5* determine the linkage density z = L/S of the food web, 
and the directed connectance C = L/S"^. Since the species are uniformly distributed along the 
segment [0, 1], one can express the average number of prey per species as Sa, where the bar 
indicates an average over all species in the web. It then follows that the linkage density is 

z = Sa (2) 



and the connectance is 

C = a. (3) 



One can also obtain these expressions in terms of b by noting that n and x are independent 
variables, namely 

a = nx = — ; — , (4) 

2(1 + 6) ' ^ ^ 



where n = 1/2 and x = 1/ {1 + b). 

In the niche model, isolated species — that is, species with no prey or predators — are eliminated, 
and species with the same list of prey and predators — that is, trophically-identical species — are 
"merged." For C = a -C 1, the probability that two species are trophically identical is very 
small. This s uggests that taxonom ic and trophic classifications of species lead to similar results 



in this limit dwilliams and Martinez! l2000l) . 



In the following subsections we derive analytical expressions that enable us to predict the prop- 
erties of the webs generated by model. 



2. 2 Distribution of number of prey 



For large S, the number of prey of species iis ki = Sai, so that the probability of having k prey 
Pprey is given directly by the distribution of a. Specifically, 

Ppreyik) = p{a)/S . (5) 



In order to evaluate ]9(a), let us note that a is the product of two independent stochastic variables, 
n and x, both ranging between and 1. As illustrated in Fig. 1 , it then follows that the cumulative 
probability P{a' > a) = da'p{a') is the probability that a pair of random values (n, x) fall 
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Niche number, n 

Fig. 1. Calculation of the distribution of segment lengths a. Each point inside the n — x diagram yields 
a value a = nx. The hyperbole nx = const divides the n — x plane in two regions, which we color 
grey and white. The region R, colored in grey, contains all the points for which a > 0.5. Therefore, the 
probability of drawing a pair (n, x) belonging to region R can be calculated using Eq. (6). 



in the region R of the n — x diagram bounded by the lines x 

a = nx, 



1, n = 1, and the hyperbole 



P{a > a) = J da'p{a') 

dx dnpn{n) Px{x) 



R 

1 1 
dx J dnpn{n)px{x) 

<^ a/x 



(6) 



where Pnin) = 1 is the probability density function for n. The integration of Eg. (6 ) gives rise 
to a closed form involving hypergeometric functions ( Gradstheyn and Ryzhik . 200fl|) . 



However, we can obtain a simpler analytical solution for p(a) than that given by the hypergeo- 
metric functions as follows. In the limit C <^ I, which implies b ^ 1, Px{x) is negligible except 
when a; ^ 1 . We can then approximate p,j. as 



Px{x) =b{l - x) 



(6-1) 



be 



-bx 



(7) 



in the entire x-range and expect the results to remain unchanged in the limit C <^ 1. Under this 
approximation, the solution of Eq. (6) yields 



p{a) = bEiiba) , 



(8) 
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Fig. 2. (A) Linear and (B) log-linear plots of the distribution of number of prey for 1000 simulations 
of food webs with S = 1000. We show results for z = 5 and 10 and the corresponding theoretical 
predictions. As predicted by Eq. (9), we find an exponential decay of the distributions. (C) Log-linear 
plot of the probability densities for the scaled number of prey k/2z for finite C. We show results for 
C = 0.05,0.1 and 0.2, for 50000 webs generated using the niche model. Dashed lines (solid Unes) 
correspond to simulations with S = 100 {S = 1000) for the appropriate values of z (namely z = 5, 10, 
and 20, for S = 100, and z = 50, 100, and 200 for S = 1000). The thick solid line is the theoretical 
prediction in the limit C 0. One observes the collapse of distributions having the same C but different 
S, similar to what we derived analytically for C — > 0. Note that the decay is faster than exponential for 
finite C. 



where Ei (x) = dtt ^ exp ( — t) is the exponential-integral function (iGradsthevn and Rvzhik , 
200nl) . The probability distribution Pprey(^) is obtained from Eq. (8) making the substitutions 
a = k/S and b = S/2z — which are valid in the limit C <^ 1 — yielding 



Pp,ey(A;) = (1/2^) E,ik/2z). 



(9) 



In Figs. 2(A) and (B) we compare the predictions of Eq. (9) with numerical simulations of the 
model. We find close agreement between our analytical expression and the numerical results. 
In particular, Pprey shows an exponential decay for large k. The deviations observed for small 
values of k are due to the fact that kj = Saj is a good approximation only when the fluctuations 
of kj are small, which is not true for small k. 

Equation (9) depends only on the scaled number of prey k/2z. Thus, for any value of z, the 
scaled variable k = k/2z obeys the same probability density function, 

p^,,yik) = EiCk) . (10) 



This probability density function is therefore universal, i.e., it is identical for any values of S 
and z provided that C is small. For finite C, Pprej has a truncation of the exponential decay at a 
value of k that is a decreasing function of C. Remarkably, even for finite C, Pp^ey only depends 
on C; cf. Fig. 3(A). 
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Fig. 3. (A) Linear and (B) log-linear plots of the distribution of number of predators for 1000 simula- 
tions of food webs with S = 1000. We show results for z = 5 and 10 and the corresponding theoretical 
predictions. As predicted by Eq. (14), we find a regime where the distribution is approximately uniform 
followed by a Gaussian decay. (C) Log-linear plot of the analytical expressions for the probability den- 
sities for the scaled number of predators m/2z for three values of z. For all values of z, we find an 
approximately constant value of Ppred for m/2z < 1 (shaded region) and a fast Gaussian-like decay for 
m/2z > 1. 

2. 3 Distribution of number of predators 



For C the predators of species i have niche values n.j > Ui, and the segment aj is placed 
with equal probability in the interval [0, nj]. Therefore, the probability that a species j preys on 
i, provided nj > nj, is 

aj/^j^Xj^j/^j^Xj, (11) 



implying that the average probability that a species with rij > rii preys on species iis x. 

If one notes that species i has {S — i) potential predators — those species with rij > rii — then 
it follows that, in the limit C <S 1, the total number of predators of i is the result of (5" — 
i) independent "coin-throws" with probability a; of a given species j, with Uj > rii, being a 
predator. This fact implies that the probability of species i having m predators is given by the 
binomial distribution, 

Ppred(^) = - ^f"^~' (12) 

It then follows that the distribution of number of predators for the set of all species in the web 
is an average over the binomials for the different values of i 

1 S-m 1 S-m f C _ j\ 

PvUm) = ;^ E pU(^) = ;s E ( ^ J^"(l - ^)'""^"^ (13) 
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In the limit of interest, 5^1, and with Sx = 2z, one can approximate the binomial distribution 
by a Poisson and the sum by an integral, yielding 



2z , 

1 [ t"^e 1 

Ppvedim) = — dt — = — 7(m + l,2z), (14) 

2z J ml 2z 





where 7 is the "incomplete gamma function" ( Gradstheyn and Rvzhik . 2000[) . For m < 2z, 



Pprcd is approximately constant because 7(m + 1, 2z) ~ 1, while for m> 2z Pprcd decays with 
a Gaussian tail. In Figs. 2(A) and (B), we compare the predictions of Eq. (14) with numerical 
simulations and find good agreement. 

Unlike the scaling seen for the distribution of number of prey, Eq. (14) is not simply a function 
of the scaled variable 'm/2z. However, for small values of m/2z, 7 is a constant and thus it does 
not depend on m or z. So, the probability density for the scaled variable fn = ■m/2z, 

Ppred(?^) = 7(2-2 m + 1, 22;) ^ 1 m<l (15) 



for any z. For m > 1, p^redijn) decays as a Gaussian. In Fig. 3(B) we show that the decay rate 
of Ppred increases with z, becoming a sharp step drop at m = 1 as 2; — 00. 



2.4 Distribution of number of trophic links 



The number r of trophic links of a species is the sum of the number of prey and number of 
predators of that species. In Fig. 4, we display the probability densities piinks(^ = r /2z) obtained 
from numerical simulations for S = 100 and S = 1000. Both distributions show a maximum 
around f = 1 and the same decay in the tail as the distribution of number of prey, i.e. an 
exponential decay for small C, and faster than exponential decay for larger C values. This 
similarity is expected since the distribution of number of predators decays very rapidly, so that 
large values of r must arise due to large values of k. 

Obtaining an analytical solution for piinks(^) is considerably more difficult than for Pprcy or 
Ppred because it requires one to estimate the correlations between the number of prey and the 
number of predators. The rules underlying the niche model imply that the number of predators 
is negatively correlated with the number of prey. That is to say, a species having a large number 
of predators is likely to have a low niche number, and thus it will likely predate upon few or no 
species. This attribute of the niche model is discussed in more detail in Section 2.7. 

If one neglects, for the moment, correlations between the distribution of number of prey and 
number of predators, the probability density for the scaled total number of links, f = r/2z, is 
found simply by the convolution of both distributions. In the limit C ^ 1, one has 
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Scaled number of links 

Fig. 4. (A) Linear and (B) log-linear plots of the the probability densities for the scaled number of links, 
f = rjlz. We show results for 2; = 10 and web sizes S = 100 (dotted Une) and S = 1000 (dashed 
line). The two curves show a similar behavior: A maximum around f = 1, and a tail dominated by the 
distribution of number of prey. For S = 1000, one has C = 0.01 and the decay is exponential. For 
S = 100, C = 0.1 and the decay is faster than exponential (see Fig. 2). We also show two analytical 
curves in the limit C 0: the convolution, Eq. (16), for 2; = 10 (thick solid line) and for z = 00 (thin 
soUd Une). As with the simulations, the theoretical curves also display a maximum at f «i 1. The decay 
for both analytical curves is exponential, as is the case for the distribution of number of prey for C ^ 0. 



Plmks(^^) = J Ppvey{t)Ppved{r - t) dt 


f 

= j Ei{t)-f{2z{r -t) + l, 2z) dt (16) 




This integral cannot be expressed in terms of recognizable functions, but can be calculated 
numerically. In Fig. 4 we plot the prediction of Eq. (16) for 2; = 10. As we obtained for the 
simulations of the model, we find a maximum of punks around f — 1 and an exponential decay 
in the tail. The latter arises from the exponential tail found for the distribution of number of 
prey, as previously mentioned. 

One can also obtain an explicit expression for punks when z ^ 00. One expects this solution 
not to differ much from that for a finite z, since the only difference is that a smooth step in 
the distribution of the number of predators around m = 1 is substituted by a sharp one. As 
discussed in the previous section, the limits of 2; — > 00 and C <^ 1 yield 
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1 if m < 2z, 

Ppred("^) = { (17) 

if m > 2z. 
Integration of Eq. (16) then yields 



, l-E2(r) if r <1, 

Piinks(r) = <! (18) 

E2(f- 1) -E2(f) if f>l. 



with . E^fx) = l—exp{—x)+xEi{x) the exponential-integral of order 2 dGradsthevn and Ryzhik . 



200d) . Figure 4 also displays Eq. (18), showing a curve rather similar to the convolution for 



z = 10 except close to r = 1. 

From the analytical solution, one can demonstrate that the tail of punks decays exponentially. For 
large f, Eq. (18) can be approximated by 



E^if - 1) - E2{f) - -E'^{r - 1/2) 
= Ei(f-l/2) 
exp{—r + 1/2) 
f - 1/2 



(19) 



where E'2 = —Ei is the derivative of function E2 a nd the last expression is the dom inant term 
in the expansion of exponential-integral functions ( Gradsthevn and Rvzhik . 2000h . Note that 
Eq. (19) is a good approximation of Eq. (18) for f > 2. 



2.5 Fraction of top, intermediate, basal and cannibal species 



As the names indicate, top species have no predators and basal species have no prey, while 
intermediate species are those with both prey and predators. The fraction T of top species is, by 
definition, 

r-Pp,..(0) = i^^3^, (20) 



as directly seen from Eq. (14). Typically empirical webs have 2 < 2^; < 20, a region for which 
Eq. (20) can be approximated simply as T = l/2z. 

To calculate the fraction B of basal species, we note that a species has no prey only if its range 
a falls in a region with no species. In the limit of large S, the probability density for fin ding a n 



empty interval of length 5 is 5*6 , as predicted by the canonical distribution (Pathria. ll972h . 
Thus, the probability of finding a species-free segment of length larger than a is e~'^", which 
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Fig. 5. Fraction of top and basal species as a function of the linkage d ensity z. The shade d region corre- 
sponds to the interval of z typically observed in empirical food webs dPunne et al 1'2002'). (A) Compar- 
ison of the results for 1000 simulations of food webs for which isolated species were removed with our 
theoretical prediction Eqs. (20), solid line, and (22), dashed line, for fraction of top species. (B) Compar- 
ison of the results of 1000 simulations of food webs for which isolated species were removed with our 
theoretical prediction Eqs. (21), solid line, and (22), dashed line, for fraction of basal species. Note that 
for both T and B there is good agreement between the analytical expressions and the numerical results 
when C ^ 1. Also note that the theoretical predictions provide narrow bounds for the numerical results 
in this limit. 



yields the probability for a species with a range a not to prey on other species. Using Eq. (8), it 
follows that the fraction of bottom species is: 



1 

B = j da e~^>(a) 



ln(l + 2z) 
2^ ' 



(21) 



Note that we must take this approach to the calculation because the definition of the fraction B 
of basal species, j9prcy(0), leads, in the continuous analysis, to a divergence as the exponential- 
integral function Ei{x) diverges in the limit x — 0. 

In the niche model, isolated species are eliminated, so they are not counted toward top and basal 
species. To correct Eqs. (20) and (21) for this effect, we must account for the isolated species 
in our calculations. We estimate the number of isolated species to first order by assuming that 
having no prey is statistically independent of having no predators, implying that the fraction of 
isolated species is just the product of the fractions of top and basal species. This assumption 
does not take into account the (negative) correlations between the number of prey of a species 
and its number of predators. Nonetheless, this simple approximation provides an upper bound 
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Fig. 6. Fraction of cannibals as a function of the connectance C. We plot the fraction of cannibal species 
averaged over 5000 realizations of the model for two system sizes, and the predictions of Eqs. (23) and 
(24). It is visually apparent that the results are insensitive to changes in S and that they ai^e in good 
agreement with our theoretical predictions. The shaded area indicates one standard deviation from the 
average for S = 25; the upper and lower curves have been obtained using Eqs. (26) and (24). Our results 
demonstrate that in the niche model the fraction of cannibals depends only upon the connectance. 

for the number of isolated species which leads to lower bounds on T and B, 



r 



T 



TB 
~TB' 



B' 



B-TB 
1-TB 



(22) 



In Fig. 5, we compare our analytical predictions for the fraction of top and basal species with nu- 
merical simulations of the model. As expected, Eqs. (20-22) provide bounds for the numerical 
results in the limit C <^ 1. 

The fraction of intermediate species is just J = 1 — (T + B). 

Another quantity of ecological interest is the fraction of cannibal species. Consider a species i 
with niche number n,. Its preys are the species inside an interval of length = xrii, chosen 
according to Eq. (1), and centered with uniform probability in the interval [aj/2, Ui]. For i to 
predate on itself, it is necessary that the segment of length contains species i. This is equiv- 
alent to the distance between the center of the segment and Ui being smaller than aj/2, which 
occurs with probability y /(^j — Therefore, the average probability g for a species to be a 
cannibal is given by 

6(1 - xf-'^^dx = 2Fi(l, 1; 2C; 1/2) - 1 (23) 
1 — x/2 



J 1 1 1 

where 2-F1 is the Gauss hypergeometric function (IGradsthevn and RvzhikL 12000). One thus finds 



that, for any value of S, the fraction of cannibals depends only upon C. Equation (23) can be 



Patterns in food web structure, J. Camacho et al. 



14 



expanded for small C as 

^ ~ C + 2C2 + 0(C3). (24) 



Since the largest value of C reported in the literature is 0.3, we expect Eq. (24) to provide a 
good approximation for empirically relevant values of C (Fig. 6). 

Next, we evaluate the standard deviation oi g.\i ^ denotes the number of cannibal species in a 
single realization of the model with S species, the corresponding fraction of cannibals is simply 
gi = i/S. Therefore, the standard deviation of ge is cr^ = Ai/S. Since g is the probabilitiy 
for a species to be a cannibal, the probability of having exactly £ cannibal species is given by a 
binomial distribution 

Pcan{i) = (^,]g\i-gf-'. (25) 



Therefore, = Sg{l — g), and the standard deviation for the fraction of cannibalism is 



a, = W— (26) 



This expression implies that ag decreases with increasing S. 
2.6 Standard deviation of generality and vulnerability 



The vulnerability of a prey is defined as its number m of predator s, and the generality of a 
predator as its number k of prey. Following Williams and Martinez we define the nor- 



malized standard deviations of the vulnerability d& ay = m? /m? — 1 and of the generality as 



2 

aQ = k"^/ k — 1. By definition, one has rn = k = z. 

To evaluate ay, we first calculate m^. Equation (14) yields m? = 4z^/3 + z, so that 



.. = \/i + i. (27) 



which approaches 1/3 as 2: — > 00 (Fig. 7). Since Eq. (14) is valid only in the limits S ^ 1 
and C ^ 1, these are also the limits of validity for Eq. (27). We present these results, as well as 
simulations of the niche model in Fig. 7(A). 

We next calculate ac by direct evaluation of A;^ . if 5* ^ 1^ the number of prey of a species 
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Fig. 7. Normalized standard deviations of generality and vulnerability as a function of the average link- 
age d ensity z. The shaded region corresponds to the interval of z typically observed in empirical food 
webs (I Punne et al. . 2002 ). (A) Comparison of the results for 1000 simulations of food webs for which 
isolated species were removed with our theoretical prediction Eq. (27) for the standard deviation of the 
vulnerability. (B) Comparison of the results for 1000 simulations of food webs for which isolated species 
where removed with our theoretical prediction Eq. (29) for the standard deviation of the generality. Note 
that Eq. (27) is only valid in the limits of 5 » 1 and C <C 1, which is confirmed in (A) by the deviations 
found for S = 100. Equation (29) was derived independent of these limits and thus holds for any value 
of C. The small underestimation of our expression for ac relates to the fact that kj = Srj is a good 
approximation only when the fluctuations of kj are small, which is no longer true for small k. 



having a range a is = Sa, yielding 



A;2 



+ 11 



n X 



3(6 + 2)' 



(28) 



Thus, ac is 



3 1 + 2C 



- 1. 



(29) 



For C <^ I, ctg = y 5/3, a result that can also be obtained from Eq. (9). Unlike Eq. (27), 
Eq. (29) depends upon C and thus does not require the same limits considered in the derivation 
of Eq. (27). We show these results as well as those for numerical simulations of the niche model 
in Fig. 7(B). 
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Fig. 8. (A) Distribution of rcor, the correlation coefficient between number of prey and number of preda- 
tors obtained from 5000 realizations of the niche model. We show simulation results for z = 10. (B) 
Comparison of the average correlation coefficient obtained from 5000 simulations of the model (for 
which isolated species where removed) with our theoretical prediction, Eq. (39). The negative correla- 
tions observed are inherent to the model, as a species having a large number of predators is likely to have 
a low niche number, and consequently will predate upon few or no species. 

2. 7 Correlation coefficient 



The mechanisms by which webs are constructed in the niche model imply that a species with 
many predators is likely to have a low niche value. Furthermore, a species with a low niche value 
probably feeds on few prey. As a consequence, the more predators a species has, the smaller the 
number of prey will be, and vice versa. Thus, the number of prey of a species and its number of 
predators must be negatively correlated. 

In order to quantify the correlations between k and m we define the correlation coefficient 
between the number of prey and the number of predators. 



km — km 



\J k'^ — \J m^ — m^ 



(30) 



where, as before, the bar over a variable denotes an average over all species in a web. 

We evaluate the average correlation coefficient, rcor, for the niche model in the limits 5" ^ 1 
and C <C 1. Equation (30) can be rearranged to yield 



km/z^ 



(TqUv 



(31) 
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where ac and ay are the normalized standard deviations of the generality and the vulnerability 
respectively. Thus, in order to obtain a closed form for Eq. (31), we need only to calculate 

s 

km = ^ km p{k, m) , (32) 

k,m=0 



where p{k, m) is the probability that a species has k prey and m predators. If k and m were 
independent, p{k, m) would be simply the product Pprey(A;)ppred(^)> yielding 

s s _ 

km^Yl ^Pprey{k) "^Ppred ("T.) ^ km ^ (33) 
fe=0 m=0 

and Tcorr = 0. However, k and m are not independent. 

Let us define p{k, m, n) as the probability that a species has k prey, m predators, and niche 
number n. One then has 

p{k, m, n) — p{k\m, n)p{m\n)pn{n) . (34) 
where p{\) refers to the conditional probability. 

Remembering that p„(n) = 1 and noticing that j9(A;|m, n) = p{k\n) because the number of prey 
in the niche model is determined solely by the niche value n, one can write 

p{k,m) = J dnp{k,m,n) — J dnp{k\n)p{m\n) . (35) 

Using Eq. (35) in Eq. (32) and rearranging terms, yields 

1 

j dnknTfm, (36) 



1 

km 





with kn and m„ the average number of prey and of predators for a species with niche number n. 
One can calculate k^ by realizing that kn — Snx and Sx — 2z, so that 

kn — Snx — 2zn . (37) 



In the limits S 1 and C <^ 1, the potential predators of a species with niche value n are those 
species with niche values larger than n. Each of these species has a probability x of preying on 
it. Therefore, the average number of predators for a species with niche value n is 

rn;:^S{l- n)x = 2z{l - n) . (38) 
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Substituting Eqs. (37) and (38) into Eq. (36), one obtains km = 2z'^/3. Finally, using Eq.(27) 
and Eq.(29) in the limit C ^ one obtains 

(39) 



^5(1 + 3^) 



In Fig. 8(A), we show the probability density of model realized rcor- The figure demonstrates 
that fluctuations from the average value are not negligible for 5* as large as 100. In Fig. 8(B) we 
compare Eq.(39) with numerical simulations of the niche model, finding good agreement in the 
limits of interest, S ^ 1 and C ^ 1. The two figures confirm our initial hypothesis that the 
niche model generates webs with negative correlations between k and m. 



2.8 Assortativity 



The assortativity of a food web quantifies the correlation between the number of links— trophi c 
interactions — of a species and the number of links of its prey and predators dNewmam l2002h . 
In a food web with positive assortativity — an assortative food web — highly connected species 
tend to be connected to one another. Conversely, in a food web with negative assortativity — 
a dis assortative food web — hig hly connected sp ecies tend to connect to species with a small 
number of links. Prior work bv iNewmanI (l2002h suggests that assortativity tends to be positive 
for social networks and negative for technological and biological networks. In this subsection 
we evaluate the assortativity of the food webs generated by the niche model in the limit of large 
S and small C. 



Let us consider a trophic interaction in the network, in which the predator has r trophic links 
and the prey has r' trophic links. The assortativity A is defined as (iNewmanL 120021) 



A 



{rr') - 



(40) 



where (...) denotes the average over all trophic links in the food web. 

We first calculate (rr'). In section 2.7 we obtained the average number of prey and the average 
number of predators for a species with niche number n, namely kn = 2zn, and rn:^ = 2z{l — n). 
Thus the average number of links of a species with niche number is = kn + fn^ = 2z, 
which is independent of n. It follows then that the average number of links is the same for all 
species, independently of the niche number. Therefore, r and r' are independent variables, so 
that (rr') = (r) (r'). 

In order to evaluate these averages, we transform the averages over links, (■ ■ ■), to averages over 
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species, • • •. Let us call the number of links of the predator in link i, by definition (r) is 

(r) = ^liB^ili , (41) 



where L is the number of links in the network. The predator from link i — denoted as species 
j — has kj prey, so that it appears as the predator in kj links. Therefore, one can write 

|/^^ = ^species kj Tj ^ ^^^^ 



Since rj = kj + rrij, the numerator can be rewritten as S{k'^ + km), with the overbar denoting 
an average over species, as used in previous sections. Finally, one has 



k^ + km 

(r) = , (43) 



Analogously, one can obtain 



. + km ^^^^ 
{r) = , (44) 



2, k^ + k m'^ + 3 k'^m 



{n = — ^ ^ , (45) 



m 



^ + k'^m + 2) km? 



in = — ^ ^ . (46) 



These averages can be evaluated as in section 2.7 (see Appendix A). Substituting Eqs. (43)-(46) 
into Eq. (40), one obtains 

(2^/3-1/2)2 



26^79 + ^/3 + 1/4 



In Fig. 9, we compare this expression with numerical simulations and find good agreement in 
the limits for which our derivation is valid. 



2.9 Robustness of the analytical results 



In Table 1 we summarize the analytical results obtained for the niche model. An important 
question is that of the robustness of our predictions to changes in the particular formulation of 
the details of the model. The approximations used in the derivations of the expressions for the 
distributions of the number of prey and predators, Eqs. (9) and (14), allow us to conclude that: 
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Fig. 9. (A) Distribution of the assortativity A obtained from 5000 realizations of the niche model. We 
show simulation results for z = 10. (B) Comparison of the average assortativity obtained from 5000 
simulations of the model (for which isolated species where removed) with our theoretical prediction, 
Eq. (47). As predicted by Eq. (47), the niche model yields webs that are slightly disassortative in the 
limit of large S. For small S, food webs become slightly assortative at large linkage densities. 



Property 



Expression 



Distribution of number of prey 
Distribution of number of predators 
Fraction of top species 
Fraction of basal species 
Fraction of cannibals 

Standard deviation of the vulnerability 
Standard deviation of the generaUty 
Prey-predators correlation coefficient 

Assortativity 



Ppreyik) = {l/2z) Eiik/2z) 

PpredM = (1/2^) 7("i + 1, 22;) 

rp 1— exp {—2z) 

T3 _ In (1+22) 
° ~ 2z 

5= 2i^i(l,l;2C;l/2)-l 
« C + 2C2 + 0{C^) 
av = ^1/3+ lA 



<TG = V8/(3 + 6C) - 1 

r = 1 

V5(i+3/2) 

. ^ (2^/3-1/2)2 

26^2/9+2/3+1/4 



Table 1 

Summary of the analytical expressions obtained for the niche model in the limits 5* » 1 and C <^1. 

(1) The distribution of number of prey depends on the functional form of p{x), but Eq. (9) will 
still be obtained for any p{x) decaying exponentially as C (or x) tends to zero. 

(2) The distribution of number of predators does not depend on the specific form of p{x). The 
only requirement is that the connectance C — x/2 tends to zero under some limit, so that 
2z — SC remains finite when 5" — > oo. 
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Thus, it is safe to conclude that our findings are robust under quite general conditions, a result 
that is not possible to obtain without an analytic treatment of the problem. Moreover, as we will 
show in the next section, the quantitative patterns uncovered for the niche model (Table 1) are 
important guides for the study of empirical food webs. 



3 Patterns in food web structure 

In this section, we compare the predictions of our theoretical solution of the niche model with 
data from empirical food webs obtained from a variety of environments. Remarkably, we find 
that the quantitative patterns uncovered for the niche model also describe the properties of food 
webs pertaining to very diverse habitats, such as freshwater, marine-freshwater interfaces, and 
terrestrial ecosystems. 



3.1 Cumulative distributions 

The reported empirical food webs generally contain a small number of trophic species. This fact 
implies that the empirical distributions of the number of prey and number of predators will be 
quite noisy. For this reason, we consider here the cumulative distributions instead of probability 
density functions. 

We first derive an analytical expression for the cumulative distribution of number of prey, 
^prey(> k) = J2k'>kPprey{k'), in the limit of large sizes and small connectances. Since l/2z is 
generally quite smaller than 1, one can substitute the sum by an integral with negligible error, 
yielding 




(48) 



which, similarly to Eq. (9), predicts an exponential decay for large k. In terms of the scaled 
variable k = k/2z,'we obtain 




(49) 



which, as Eq. (9), contains no free parameters. 

Next, we derive an analytical expression for the cumulative distribution of number of predators. 
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In the limits S ^ 1 and C <^ 1, one can use Eq. (14) to obtain 

-j^ oo 

^'pred(> = ^ E + 1, 2^) , 

m'=m 



(50) 



As we have already noted, for m < 2z the gamma function can be approximated as 7(m + 
1, 2z) ^ 1. We can then rewrite Eq. (50) as 



Ppred(> m) = 1 - — l{rn' + l,2z) 

m'=0 



m <2z 



(51) 



For m > 2z, Eq. (51) decays as the error function ( Gradstheyn and Rvzhik . 2000l) . 



Next, we analyze the empirical data for fifteen food webs with 25 to 155 trophic species. These 
webs have linka ge densities 1 .6 < z < 17.7, a nd connectances in the interval 0.026-0.315 
( Williams and MartinezlbOOol: Dunne et al. , 2002 ). We first investigate the distributions of num- 
ber of prey and number of predators. In Figs. 10-12 we compare the cumulative distributions of 
the number of prey, number of predators, and number of trophic links for species in the fifteen 
food web investigated with the numerical predictions of the model for their specific values of S 
and z. 



To quantify the agreement between the empirical data and the model, we apply the Kolmogorov- 
Smirnov (KS) test to the empirical distributions — prey, predator, and number of trophic links — 
and 1000 realizations of the model. The average output of the KS test is plotted in Fig. 13. We 
regard Pks ^ 0.01 — shown in black in Fig. 13 — as strong evidence for the rejection of the 
null hypothesis. Our results suggest that eleven of the fifteen food webs studied are well ap- 
proximated by the niche model: Bridge Brook, Skipwith, Coachella, Caribbean Reef, Benguela, 
St. Martin, Shelf, Chesapeake, St. Marks, Little Rock, and Grassland. The remaining four — El 
Verde, Canton, Ythan, and Stony Stream — exhibit rather different behavior, which is visually 
apparent in Figs. 10-12 and confirmed by the results in Fig. 13. 

Equations (48)-(51) and the results of Figs. 10-13 suggest the possibility that, to first approx- 
imation, Pprey and Ppred obcy universal functional forms that depend only on z. In order to 
further investigate whether these patterns of similarity are true, as well as if these distributions 
are universal, we perform the Kolmogorov-Smirnov test between all pairs of food webs. To 
do this we take advantage of the scaling determined earlier and use the scaled variables as the 
basis for comparison. Equation (49) predicts that Pprcy(> k) depends only on k/2z. The scal- 
ing of Pprcd(> fn) is not as straightforward. As discussed before, "true" scaling holds only for 
'm/2z < 1/2, while for larger values of m/2z there is a Gaussian decay of the probability func- 
tion with an explicit dependence on z. However, the decay for m > 2; is quite fast and, to first 
approximation, not very relevant. 

We show the results of these tests in Fig. 14. We use the same rejection criterion as before. The 
results of Fig. 14 also support the hypothesis that there are eleven food webs that conform to the 
theoretical predictions. 
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Fig. 10. Cu mulative distrib ution -Pprey of n umber k of pre y for the fifteen fo od webs stud ied: Bridge 
Broo k Lake jHavensL[l992h : S kipwith Pond dWarrenL fTosi) : Co achella Vallev JPoUsLll99lh: Caribbea n 
Reef (' Opit7L h996'v Benguela {Y^^\l29^; St. M artin Isl and ('Ooldwasser and Roughgardenl. ll993h : 
Northeast US Shelf (Link, 2002); Chesapeake Bay teaird and Ulan owicz> .1989): St. Marks Seagrass 
(IChristian and Luczk ovich, 19 99); Little Rock L ake (Martinez, 1 991); Grassland ( Martinez et al.. 1999) : 
El V erde R ainforest Jwaide and ReaganL 1996h: Canton Creek dPownsend et al.lll998l) : Ythan Estuary 
JhTiI and Raffaelli,'l99l'); and Stony Stream ( Town send et al.l The solid black line represents the 

average value from 1000 simulations of the niche model and the grey region represents two standard 
deviations above and below the model's predictions. 
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Number of predators, m Number of predators, m Number of predators, m 

Fig. 11. Cumulative distribution Pprcd of number m of predators for the fifteen food webs studied (see 
Fig. 10). The soUd black line represents the average value from 1000 simulations of the niche model and 
the grey region represents two standard deviations above and below the model's predictions. 

For this reason, from this point on we will focus our attention upon these eleven food webs — 
Bridge Brook, Skipwith, Coachella, Reef, Benguela, St. Martin, Shelf, Chesapeake, St. Marks, 
Little Rock, and Grassland. 

We plot in Figs. 15(A) and (B) the cumulative distributions Pprcy (> k) versus the scaled variable 
k/2z for the eleven similar food webs and find that the data collapse onto a single curve, again 
supporting the possibility that -Pprey obeys a universal functional form. Similarly we plot -Ppred(> 
m) versus the scaled variable m/2z for the eleven similar food webs in Figs. 15(B) and (C), 
again finding a collapse of the data onto a single curve for m/2z < 0.5. 
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Fig. 12. Cumulative distribution Punk of the number r of trophic links for the fifteen food webs studied 
(see Fig. 10). The solid black line represents the average value from 1000 simulations of the niche model 
and the grey region represents two standard deviations above and below the model's predictions. 



Figure 15 again supports the strong new hypothesis that the distributions of number of prey 
and number of predators follow universal functional forms. To improve statistics and better 
investigate the specific functional form of these distributions, one may pool the scaled variables, 
k/2z and m/2z, from the different webs into single distributions, -Pprcy and Pprcd^ respectively. 
Figures 16(A) and (B) show the cumulative distributions of scaled number of prey and scaled 
number of predators for the pooled webs. Note that the distributions are well approximated by 
Eqs. (49) and (50) even though there are no free parameters to fit in the analytical curves. These 
results a re analogous to the finding of scaling and unive rsality in physical, chemical, and social 
systems (IStanlevL[l97llll999HBunde and Havlin[ll994h . 
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Little Rock LR 
Grassland Gr 
El Verde Ve 
Canton Ca 
Ythan Yt 
Stony Stream SS 

Fig. 13. Comparison of the distributions of prey, predators, and total number of links of the fifteen food 
webs to the respective distributions obtained from 1000 webs generated by the niche model. We use the 
KS test for the comparison. Column one is the result for distribution of prey, column two the result for 
distribution of predators, and column three the result for distribution of total number of links. We regard 
Pks < 0.01 — shown in black — as strong evidence for the rejection of the null hypothesis. 




In Fig. 16 we plot the probability densities for the distribution of number of prey and number of 
predators for the pooled webs. It is visually apparent that both distributions are different. This is 
confirmed by the Kolmogorov-Smimov test which rejects the null hypothesis at the 0.2% level. 
The distribution of number of prey decays exponentially while the distribution of number of 
predators is essentially a step function. 



One can perform a similar analysis for the distribution punk of the number of trophic links r. As 
for number of prey or number of predators, the data from the different webs, upon the scaling 
r/2z, collapse onto a single curve, further supporting the hypothesis that scaling holds for food 
web structure. To better determine the specific functional form of piink('^)j in Fig. 16(D) we pool 
the scaled variables, r/22;, from the eleven food webs into a single distribution. We find that 
Piink('") has an exponential decay for ^ 1, in agreement with our theoretical calculations. 
Therefore, there is a characteristic scale for the linkage density, i.e. food webs do not have 
a scale-free structure, in contrast to erroneous repo rts in recent studies of food-web fragility 
dSole and MontovaLEooHlMontova and SoleLl2002l) . 



The analytical expression for the distribution of the total number of links, Eq. (16), was derived 
assuming that the number of prey and number of predators are uncorrelated. In Fig. 17 we 
compare the empirical rcor with data generated from simulations of the niche model. It can be 
seen that, just as in the niche model, food webs show, on average, slightly negative correlation 
between number of prey and number of predators. It is also important to note that for both the 
empirical data and the niche model, the is a wide range of possible correlation coefficients which 
may occur. 
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Fig. 14. Comparison of the distributions of prey, predators, and total number of links of the fifteen em- 
pirical food webs. We use the KS test for the comparison. The resulting matrices may be interpreted as 
similarity matrices with values < Pks < 1> the KS probability. We show the matrix for distribution 
of prey k in the top left, the matrix for distribution of predators m in the top right, and the matrix for 
distribution of total number of links r in the bottom left. We regard Pks < 0.01 — shown in black — as 
strong evidence for the rejection of the null hypothesis. 



3.2 Assortativity 



It has been reported that technological and biological networks are most comm only disassortive , 
in contrast to social networks which are most commonly assortative in nature (INewmanL l2002h . 
We analyze the assortativities of the fifteen empirical food webs investigated and compare this 
data to numerical simulations of the model. We confirm that food webs are mostly disassortative. 
However, the assortativities cover a wide range of values from moderately disassortative to 
slightly assortative. The numerical results from simulations of the niche model also show a 
similarly wide range of possible values, as shown in Fig. 18. 
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Fig. 15. Visual test of the "scaling hypothesis" that the distributions of number of prey, predators, and 
trophic links have the same functional form for different food webs. (A) Cumulative distribution Pprey 
of the scaled number of prey k/2z for the eleven food webs. The solid line is the prediction of Eq. (48). 
The data "collapse" onto a single curve that agrees well with the analytical results. (B) Cumulative 
distribution Ppred of the scaled number of predators m/2z for the eleven webs. The solid line is the 
analytical prediction of Eq. (50) for the average value of z in the empirical data, z = 8.44 (C) Cumulative 
distribution Pymks of the scaled number of predators r/2z for the eleven webs. The solid line is the 
analytical prediction. Semi-logarithmic plot of the scaled distributions of (D) number of prey, (E) number 
of predators, and (F) total number of links. The symbols are those introduced in Fig. 10. 



3.3 Cannibalism 



We determine the fraction of cannibal species as a function of the connectance C for the empir- 
ical webs considered, and compare it with the predictions of the niche and the random model 
analyzed in Appendix B. We find that the analytical expression for cannibalism supplied by the 
niche model captures the behavior of all fifteen food webs we study. In contrast, the predictions 
of the random model, shown in Fig. 19(B), cannot capture the behavior of Coachella or the 
Caribbean Reef webs. 

It is observed that the empirical values are generally smaller than the predictions. Indeed, some 
food webs have no cannibals at all. Such low ratios of cannibalism is seen by some authors 
as unrealisti c, and possibly due to the large effort needed to build a comprehensive food web 
( PolisLll99lh . 
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Fig. 16. Cumulative distributions (A) Pprey of the scaled number of prey, kjlz, and (B) Ppred of the 
scaled number of predators, ■m/2z, for the eleven pooled webs. The solid lines are, respectively, the 
analytical predictions Eq. (49) and (50), the latter with the average value z = 8.44. (C) Cumulative 
distribution of the number of trophic interactions per species r = A; + m for the eleven pooled webs. The 
soUd Une is obtained by numerically convolving the distributions Eq. (49) and (50) while the dashed line 
is obtained by numerical simulations of the niche model for S" = 511 and z = 8.44, the parameter values 
of the pooled disttibutions. The tail of the distribution decays exponentially, indicating that food webs do 
not have a scale-free structure. 
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Fig. 17. Correlation coefficient between number of prey and number of predators for the fifteen food 
webs investigated. Probability density plot for the correlation coefficients obtained empirically compared 
to the probability density of rcor from 5000 realizations of the niche model at the average properties of 
the empirical webs studied, S* = 64 and z = 8.06. Note the good agreement between data and model 
predictions. The white boxes represent the eleven food webs which are well described by our analytical 
expressions and the niche model while the grey boxes represent the four food webs which are not. 
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Fig. 18. Assortativity of the fifteen food webs investigated. Probability density plot for the assortativities 
obtained empirically compared to the probability density of assortativity from 5000 realizations of the 
niche model at the average properties of the empirical webs studied, namely 5 = 64 and z = 8.06. 
The food webs are, on average, slightly disassortative; however, they cover a wide range of values from 
moderately disassortative to slightly assortative and all of the se behaviors are captured in the predictions 
of the model. Assortativities were calculated as outlined in ( Newman! 2002h . As in Fig. 17, the white 
boxes represent the eleven food webs which are well described by our analytical expressions and the 
niche model while the grey boxes represent the four food webs which are not. 



3.4 Network theory mesaures 



Next, we investigate if the scaling hypothesis suggested by the analysis of distribution of trophic 
links also applies to other quantities characterizing food web structure. We consi der two quanti- 
ties w ith ecologic implications: (i) the average trophic distance between species ( Watts and Strogat3. 



19981) . which is defined as the typical number of spe cies needed to t rophic ally connect two 



given species, and (ii) the clustering coefficient C ( Watts and Strogat3. 1998[) which quantifies 
the fraction of species triplets that form fully-connected triangles. 

In Fig. 20(A) we compare our numer ical results for the average trophic distance d for the niche 
model ( Williams and Martinez',"2000^ with the values calculated for the food webs analyzed. We 
observe that there is agreement between the niche model and the empirical data. Remarkably, 
the behavior predicted by the model also holds for randomization of the empirical data, where 
the randomization of links between species is such that the distributions of number of predators 
and number of prey remain the same. We also find that d increases with web size as log S both 
for the model and for the data. 



The results of Fig. 20(A) also support the scaling hypothesis and suggest that the average dis- 
tance in a food web may also follow a unique functional form for different food webs. This 
feature is predicted remarkably well by the niche model, even for the four webs for which the 
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Connectance, C 

Fig. 19. Fraction of cannibal species as a function of the connectance C for empirical webs relative to 
(A) the niche model and (B) the random model. Thick solid Unes correspond to model predictions Eqs. 
(24) and (B.l), respectively. The shaded area is within two standard deviations from the model average. 
One observes that the empirical results are in good agreement with the niche model. The dashed lines 
represent the best fit quadratic (A) and linear (B) expressions for the empirical data. The symbols are 
those introduced in Fig. 10 except that the four poorly approximated food webs are filled in grey. 

model fails to describe other topological properties. 

Figure 20(B) shows our results for the clustering coefficient C of the food webs studied and for 
the niche model. We find that the data is well approximated by the model predictions which 
show that C decreases to zero as l/S" as web size 5" increases. This result does not hold for 
Grassland — which has a much higher value than predicted — nor for Canton or Stony Stream — 
which feature much lower values than predicted. As for the average distance d, the niche model 
accurately predicts the values of the clustering coefficient even after the randomization of the 
empirical food webs. Again, this suggests that there are robust features inherent in the simple 
framework of the niche model which allows it to describe well the most complete empirical data 
available. 



4 Concluding remarks 

The major finding of this study is the uncovering of unifying quantitative patterns character- 
izing the structure of food webs from diverse environments. Specifically, we find that, for the 
majority of the most complete empirical food webs, the distributions of the number of prey, 
number of predators, and number of trophic links obey universal scaling functions, where the 
scaling quantity is the linkage density. Remarkably, these scaling functions are consistent with 
analytical predictions we derived for the niche model. Therefore, our results suggest that these 
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Fig. 20. (A) Scaled average trophic distance d between sp ecies versus linkage density z . We compare 
the data with the numerical simulations of the niche model ( Williams and Martinez','2000') for web : 



sizes 



S = 100, 500, 1000 (thin solid lines). We find a logarithmic increase of the average distance with web size 
S for the empirical food webs, in good agreement with the model predictions. (B) Double-logarithmic 
plot of the clustering coefficient C versus the scaled web si ze S/2z. We compare the data with numerical 
results for the niche model ( Williams and M artinez. '2000*) for three values of the linkage density in the 
empirically-relevant range (z = 2.5, 5, 10). We find that the clustering coefficient of the food webs is 
inversely proportional to the web size S, in good a greement with the model p redictions and with the 
asymptotic behavior predicted for a random graph ( Watts and Strogat3. 1998h . For both (A) and (B) 
the grey circles represent the average values calculated from 1000 randomizations of the links of the 
empirical food webs keeping the same distributions of number of prey and number of predators. Note 
that the behavior of these randomized webs is still captured quite well by the niche model implying 
that the underlying distributions themselves are responsible for this behavior. The symbols are those 
introduced in Fig. 10 except that the four poorly approximated food webs are filled in grey. 

distributions can be theoretically predicted merely by knowing the food web's linkage density, 
a parameter readily accessible empirically. 

Our results are of interest for a number of reasons. First, the results of Fig. 20 — which also 
support the sc aling hypothesis — indicate that there is very little, if any, compartmentalization 
in ecosystems ( Pimm and Lawtonl 1980l) . suggesting the possibility that ecosystems are highly 
interconnected and that the removal of any species may induce large disturbances. Second, reg- 
ularities such as these are interesting as descriptors of trophic interactions inside communities 
because they may enable us to make predictions in the absence of high-quality data, and pro- 
vide insight into how ecologic communities function and are assembled. Third, the structure 
of food webs is different from that of many other biological networks in two important as- 
pects: the links are uni-directional and the in- and out-degree distribution are different. These 
two facts are a result of the directed character of the trophic interactions and of the asymme- 
try it creates. Interestingly, the niche model captures this asymmetry in its rules, which may 
explain its success in explaining the empirical results. Fourth, food webs do not have a scale 
free distribution of number of links (total, prey, or predators). This may be viewed as surprising 
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since one could expect most species to try to prey on the most abundant species in the ecosys- 
tem (an "abundant-get-eaten" kind of mechanism). Such a preferential attachment would lead 
to a scale-free distribution of links; instead, we find a single-scale distribution, suggesting that 
species specialize and prey on a small set of other species. 



We do not intend to provide here a detailed study of why four of the fifteen food webs we 
consider do not accomodate to the patterns we report. We will, nonetheless, offer a few re- 
marks on this matter. The four webs that do not conform to our patterns are Ythan Estuary, 
Canton Creek, Stony Stream, and El Verde Rainforest. Let us first consider the Ythan Estuary 
web. It has been noted already that this web displays an over-representation of top bird species 
( Williams and MartinezL 2000[) which could account for the differences. For Canton Creek and 
Stony Stream two aspects distinguish them from the other food webs. First, they are quantita- 
tively quite similar. This fact is illustrated by noting that the Kolmogorov-Smimov test provides 
Pks of 0.565, 0.045, and 0.794 under direct comparison of their respective distributions of num- 
ber of prey, number of predators, and total number of links. Upon further investigation this easily 
explained as the original authors stated plainly that it was their intention to compi le food webs 
from habitats that were as similar as possible to each other (Townsend et al. Ill998h. Second, the 



Canton Creek and Stony Stream webs are also time specific (Townsen d et al.Lll998|) -meaning 
their data was collected solely on one occasion — as opposed to cumulative — which are based 
upon data collected on multiple occasion until reaching some state of "completeness." This 
time specific nature not only distinguishes them from all of the other food webs which we have 
studied, but also implies that they contain rather different information and are not directly com- 
parable to cumulative food webs in our framework. We lastly address the El Verde Rainforest 
food web by pointing out that over one third of the links were not observered in the field but 
rather are based upon iteractions involving closely related sp ecies in the forest or obse rvations 
or published accounts of the interaction outside of the forest ( Waide and Reagan . 1996h . 



To conclude, we want to stress that our findings are remarkable for two main reasons: (i) they 
hold for eleven out of fifteen of the most complete food webs studied, in contrast to previously 
reported patterns, and (ii) they support the conclusion that fundamental concepts of modern 
statistical physics such as scaling and universality — which were developed for the study of 
inanimate systems — may b e succesfull y applied in th e study o f food webs — which comprise 
animate beings ( Camacho eTZi 12002 Al: ICiarl aschel 1 i eTZi 12003). Indeed, our results are con- 
sistent with the underlying hypothesis of scaling theory, i.e., food webs display "universal" 
patterns in the way trophic relations are established despite apparently "fundamental" differ- 
ences in factors such as the environment (e.g. marine versus terrestrial), ecosystem assembly, 
and past history. 
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A Variables 



Model Variable 

variable definition 

S Number of trophic species 

L Total number of trophic links 

C Directed connectance, C = L/S"^ 

z Linkage density, z = L/S 

n Niche number in niche model, n E [0, 1] 

a Range of predation in niche model 

X Beta-distributed random variable 

b Characteristic parameter for beta-distribution of x 



Patterns in food web structure, J. Camacho et al. 



37 



Ecological 


Variable 


variable 


definition 


k 


Numbpr of nrpv 


k 


Scaled number of prey, k = kjlz 


Ul 


Number of oredators 


ifi 


Scalpd niimhpr of nrpdators ?t? — th rl7 


T 


Total niimhpr of tronViip linlc^ t — -1- to 


f 


Scaled total number of trophic links, f = t/2z 


T 


Fraction of ton SDCcies 


B 


Fraction of ha<sal sr>pcip<s 


I 


Fraction of intermediate species 


9 


rraciion oi cdnniudis 


Oy 


Standard deviation of vulnerability 




Standard deviation of generality 


'"cor 


Correlation coefficient between species number 




of prey and number of predators 


A 


Assortativity 



B Fraction of cannibal species for a random linking model 



In this Appendix we calculate the fraction of cannibal species for a random model, i.e. a model 
where the links between species are placed at random. For a system with 5" species and z average 
number of links per species, the probability that a species feeds on itself will be 



fl'rand 




(B.l) 



The standard deviation of ^^rand is obtained from Eq. (26) by using g = gfrand^ yielding 



/ fi'rand(l fl'rand) /r> o\ 



In the limit of small C, the predictions of both models are similar. Note that this occurs re- 
gardless of the particular form of Px{x). In fact, C — > is equivalent to x — > 0, so that Px{x) 
takes non- vanishing values only for very small x. One can then neglect x/2 versus 1 in the 
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denominator of Eq. (23) and get 

/X /2 "x 
Px{x) ^_ ,^ dx -^C = g^rand ■ (B.3) 

' 

Therefore, only the second term in the right hand side of expression Eq. (24) depends on the 
specific form of Px{x). 



C Calculation of the moments of the distributions of number of prey and number of 
predators 



In this Appendix we calculate averages of the type k^rri>, with i and j integers, in the limit 
of large web sizes and small connectances. These averages are used in the calculation of the 
assortativity, in Section 2.8. 

In Section 2.7, we obtained Eq. (35) for the join probability p{k, m), namely 

p{k,m) — / dnp{k,m,n) — / dnp{k\n)p{m\n) . (C.l) 



With this, the average we want to calculate writes 

k^m^ = j dn i'^k^p{k\n)\ i m^p{m\n) J = jdnkl^mi,. (C.2) 



\k=0 / \m=0 



The probability p{k\n) of a species with niche value n to have k prey can be calculated as 
follows. According to the rules of the niche model, in order to have k prey, the parameter x 
must take the value x — k/Sn. Then, by performing a change of variables, one obtains 

where p^ is given by Eq. (1). In the limit of small C, p^ can be approximated by 

Px{x) ~ 6exp(-6x) ~ 2Cexp(-a;/2C). (C.4) 



Then, p{k\n)i& given by 

p(,|„) = _i_exp(-A). (C.5) 
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The average values kj^ can be evaluated, in the limit of small C, as 

oo 1 I 



In order to evaluate mli we need p{m\n). In the limit S* ^ 1 and C ^ 1, the number of 
potencial predators of a species of niche value n (that is, the number of species with a niche 
number larger than n) is S{1 — n). As seen in Section 2.3, each of them has a probability x to 
prey on the species. Thus, the probability that a species with niche parameter n has m predators 
is simply the binomial distribution. 



p{m\n) = i^^^ ""Ax^il-xf^^-^^-"^, 
\ m J 


(C.7) 


that in our limit yields the Poisson distribution, namely 




, , , A™exp(-A) 
p{m\n) = ^1 


(C.8) 


with 




A = S{1 - n)x = 2z{l - n) . 


(C.9) 


One can now obtain the moments 






(C.IO) 


^ = A' + A 


(C.ll) 


^ = A^ + SA" + A . 


(C.12) 


Finally, using Eq. (C.6), (C.9) and (C.10)-(C.12) into Eq. (C.l), yields 




k — rn — z , 


(C.13) 


F = 8^73 , 


(C.14) 


F = 12z^ , 


(C.15) 


rr?^z + Az^/3 , 


(C.16) 


= z + Az"^ + 2z^ , 


(C.17) 


km^ 2^73, 


(C.18) 
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^77?= 2^2(^1 + ^)/3, (C.19) 

and 

Fm = 4^73. (C.20) 



